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Abstract 

> 

r) Distributed synchronization is known to occur at several scales in 

• the brain, and has been suggested as playing a key functional role 

O in perceptual grouping. State-of-the-art visual grouping algorithms, 

however, seem to give comparatively little attention to neural syn- 
chronization analogies. Based on the framework of concurrent syn- 
chronization of dynamic systems, simple networks of neural oscilla- 



{S( tors coupled with diffusive connections are proposed to solve visual 

ON grouping problems. Multi-layer algorithms and feedback mechanisms 

• are also studied. The same algorithm is shown to achieve promising 

results on several classical visual grouping problems, including point 
qq clustering, contour integration and image segmentation. 

o 
> 

X 1 Introduction 

Consider Fig. [TJ Why do we perceive in these visual stimuli a cluster of 
points, a straight contour and a river? How is the identification performed 
between a subgroup of stimuli and the perceived objects? These classical 
questions can be addressed from a variety of point of views, both biological 
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and mathematical. This paper develops new grouping algorithms in the 
biologically-inspired framework of distributed oscillator synchronization. 

Many physiological studies, e.g. [T2], [TTJ, [23], have shown evidence of 
grouping in visual cortex. Gestalt psychology [49j [3U [20j [9], an attempt 
to formalize the laws of visual perception, addresses some grouping princi- 
ples such as proximity, good continuation and color constancy, in order to 
describe the construction of larger groups from atomic local information in 
the stimuli. 
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Figure 1: Left: a cloud of points in which a dense cluster is embedded. 
Middle: a random direction grid in which a vertical contour is embedded. 
Right: an image in which a river is embedded. 

In computer vision, various mathematical frameworks have been sug- 
gested for grouping different visual qualities. Besides some classical clus- 
tering algorithms [H] such as k-means [29j , graph-based methods have been 
proposed for point clustering [36j [40] . Geometrical grouplets [30] consider 
contour grouping in the framework of harmonic analysis. The a contrario 
school jTQl EJ E] applies probabilistic approaches to address the percep- 
tual grouping such as point clustering and contour detection. Variational 
formulations [3H [33], [I], Markov Random Fields [H] and graph cuts [ID] 
have been applied to perform image segmentation. These computer vision 
approaches have achieved important success. However, as most of them have 
been proposed under a specific motivation (and often without much interest 
for biological analogy), applications of each algorithm are usually limited in 
grouping based on one specific quality. 

In the brain, at a finer level of functional detail, the distributed synchro- 
nization known to occur at different scales has been proposed as a general 



functional mechanism for perceptual grouping EFT] . In computer vision, 
comparatively little attention has been devoted to exploiting neural-like os- 
cillators in visual grouping. Wang and his colleagues have performed very 
innovative work using oscillators for image segmentation [45l EEJ [6] and have 
extended the scheme to auditory segregation [43j [44] . They constructed 
oscillator networks with local excitatory lateral connections and a global 
inhibitory connection. Due to the large complexity, for real images a seg- 
mentation algorithm essentially without oscillators was abstracted from the 
underlying oscillatory dynamics [6] . Li has proposed elaborate visual cortex 
models with oscillators [2U ESJ [26], [27] and applied them on lattice drawings. 
Kuzmina and his colleagues [211 E2] have constructed a simple self-organized 
oscillator coupling model, and applied it on synthetic lattice images as well. 
Faugeras et al., have started studying oscillatory neural mass models in the 
contexts of natural and machine vision [11]. 

In this paper we propose a simple and general neural oscillator algorithm 
for visual grouping, based on diffusive connections [37] . We use full-state 
neural oscillator models rather then phase-based approximations. The key 
to our approach is to embed the desired grouping properties in the couplings 
between oscillators. This allows one to exploit existing results on visual 
grouping and Gestalt while at the same taking advantage of the flexibility 
and robustness afforded by synchronization mechanisms. Synchronization of 
oscillators induces perceptual grouping while desynchronization leads to seg- 
regation. Multi-layer networks with feedback are introduced. Applications 
to point clustering, contour integration, and segmentation of synthetic and 
real images are demonstrated. A recent study of stable concurrent synchro- 
nization of neural oscillators [37] provides a general analysis tool to model 
the associated nonlinear dynamics and study their convergence properties. 

Section [2] introduces a basic model of neural oscillators with diffusive cou- 
pling connections, studies its stability, and proposes a general visual grouping 
algorithm. Sections [3j [4] and [5] describe in detail the neural oscillator solu- 
tions for point clustering, contour integration and image segmentation and 
show a number of examples. Section [6] presents brief concluding remarks. 



2 Model and Algorithm 

The model is a network of neural oscillators coupled with diffusive connec- 
tions. Each oscillator is associated to an atomic element in the stimuli, for 
example a point, an orientation or a pixel. Without coupling, the oscillators 
are desynchronized and oscillate in random phases. Under diffusive coupling 
with the coupling strength appropriately tuned, they may converge to mul- 
tiple groups of synchronized elements. The synchronization of oscillators 
within each group indicates that perceptual grouping of the underlying stim- 
ulative atoms, while the desynchronization between groups suggest group 
segregation. 

The next section describes how to construct the neural oscillator networks 
and shows the stability and convergence properties of the system. A general 
visual grouping algorithm is proposed at the end. 



2.1 Neural Oscillators 

We use a modified form of FitzHugh-Nagumo neural oscillators [131 [35], s i m_ 
ilar to [28116], 

Vi = 3vi-vf-vJ + 2-Wi + Ii (1) 

W{ = c[a(l + tsxih.(/3vi)) — Wj\ (2) 

where Vi is the membrane potential of the oscillator, wi is an internal state 
variable representing gate voltage, Ii represents the external current input, 
and a, (3 and c are strictly positive constants (we use a — 12;c = 0.04;/? = 4). 
When the input Ii exceeds a certain threshold value, the neural oscillator os- 
cillates, the trace of membrane potential Vi being plotted in Fig. [2]-a. Other 
spiking oscillator models can be used similarly. In the neural oscillator net- 
works for visual grouping, each oscillator is associated to an atomic element 
in the stimuli. 



2.2 Diffusive connections 

Oscillators are coupled using diffusive connections with Gaussian-tuned gains 
to form networks. 



Let us denote by Xi = [vi,Wi] T the state vectors of the oscillators intro- 
duced in section 2.1, each with dynamics x$ = f(x^,t). A neural oscillator 



network is composed of TV oscillators, connected with diffusive coupling [46j 
ii = f(xi,t) + J^%(x j -Xj), i = l,...,iV (3) 

where k^ is the coupling strength. 

Oscillators i and j are said to be synchronized if x$ remains equal to 
Xj. Once the elements are synchronized, the coupling terms disappear, so 
that each individual elements exhibits its natural, uncoupled behavior, as 
illustrated in Fig. [2j It is intuitive to see that a larger k^ value facilitates 
and reinforces the synchronization between the oscillators i and j (refer to 
Appendix for more details). 
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Figure 2: a. a single oscillator, b. synchronization of two oscillators coupled 
through diffusive connections. The two oscillates start to be fully synchro- 
nized at about t — 30. 

The key to apply neural oscillators with diffusive connections to visual 
grouping is to tune the coupling so that the oscillators synchronize if their 
underlying atoms belong to the same visual group and desynchronize other- 
wise. According to Gestalt psychology J49J EDI EI] , visual stimulus atoms of 
similarity or proximity tend to be grouped perceptually. This suggests that 
the coupling between the neural oscillators should be reinforced if they have 
similar stimuli. Such coupling can be implemented by the Gaussian tuning 



nj 



(4) 



where Si and Sj are stimuli of the two oscillators, for example position for 
point clustering, orientation for contour integration and grey-level for image 



segmentation, and /3 is a tuning parameter. Due to its good properties such 
as smoothness, Gaussian tuning has been applied in various applications such 
as image denoising [3l [4] , segmentation [40j and recognition [39j . 



2.3 Generalized diffusive connections 

The visual cortex is hierarchical. Higher-level layers have smaller dimension 
than lower-level layers, as information going from bottom to top turns from 
redundant to sparse and from concrete to abstract 



In a feedback hierarchy, generalized diffusive connections correspond to 
achieving consensus between multiple processes of different dimensions. Im- 
plementation of the hierarchy involves connecting two or more oscillator sub- 
networks of different sizes. Oscillator networks of sizes Ni and N 2 can be 
connected with generalized diffusive connections [471 Ell : 

xi = fi(x 1 ,t) + fc 1 A T (Bx 2 -Ax 1 ) (5) 

x 2 = f 2 (x 2 ,t) + fc 2 B T (Ax 1 -Bx 2 ), (6) 

where fi and f 2 are dynamics of two networks of sizes Ni and N 2 whose 
state vectors are respectively xi and x 2 , A and B are coupling matrices of 
appropriate sizes M x Ni and M x 7V 2 , k\ and k 2 are the coupling strengths. 

For appropriate choices of dynamics, once the two layers are synchronized, 
i.e. Axi = Bx 2 , the coupling terms disappear, so that each layer exhibits 
its natural behavior as an independent network, as in the case of diffusive 
connections. 



2.4 Concurrent Synchronization and Stability 

In perception, fully synchronized elements in each group are bound, while dif- 
ferent groups are segregated. Concurrent synchronization analysis provides a 
mathematical tool to study stability and exponential convergence properties 
in this context. 

In an ensemble of dynamical elements, concurrent synchronization is de- 
fined as a regime where the whole system is divided into multiple groups 




Figure 3: Two networks (top and bottom) of different dimensions are con- 
nected with generalized diffusive connections. 



of fully synchronized elements, but elements from different groups are not 
necessarily synchronized [37J. Networks of oscillators coupled by diffusive 



connections (section |2.2) or generalized diffusive connections (section |2.3) 



are specific cases of this general framework. 

Recall that a subset of the global state space is called invariant if trajec- 
tories that start in that subset remain in that subset. In our synchronization 
context, the invariant subsets of interest are linear subspaces, corresponding 
to some components of the overall state being equal or verifying some linear 
relation. Concurrent synchronization analysis quantifies stability and conver- 
gence to invariant linear subspaces. Furthermore, a property of concurrent 
synchronization analysis, which turns out to be particularly convenient in 
the context of grouping, is that the actual invariant subset itself need not be 
know a priori to guarantee stable convergence to it. 

Finally, concurrent synchronization may first be studied in an idealized 
setting, e.g., with exactly equal inputs to groups of oscillators, and noise- free 
conditions. This allows one to compute minimum coupling gains to guarantee 
global exponential convergence to the invariant synchronization subspace. 
Robustness of concurrent synchronization, a consequence of its exponential 
converge properties, allows the qualitative behavior of the nominal model to 
be preserved even in non-ideal conditions. In particular, it can be shown 
and quantified that for high convergence rates, actual trajectories differ little 



from trajectories based on an idealized model. 

A more specific discussion of stability and convergence is given in the 
appendix. The reader is referred to [37] for more details on the analysis 
tools. 



2.5 Visual Grouping Algorithm 

The basic visual grouping algorithm proceeds in the following steps. 

1. Construct a neural oscillator network. Each oscillator is associated 
to one atom in the stimuli. Oscillators are connected with diffusive 
connections ^ or generalized diffusive connections ([5j [6]) using the 
Gaussian-tuned gains ([I]). 

2. The oscillators converge to concurrently synchronized groups in the 
so-constructed network. 

3. Identify the synchronized oscillators and equivalent ly the visual groups. 
A group of synchronized oscillators indicates that the underlying vi- 
sual stimulative atoms are perceptually grouped. Desynchronization 
between groups suggest that the underlying stimulative atoms in the 
two groups are segregated. 

Traces of synchronized oscillators coincide in time, while those of desyn- 
chronized groups are separated [42J. The identification of synchronization 
in the oscillation traces (as illustrated in the example of Fig. |I]-b) can be 
realized by thresholding the correlation of the traces or by simply applying 
a clustering algorithm such as fc-means. 

The following sections detail neural oscillator solutions for three visual 
grouping problems, namely point clustering, contour integration and image 
segmentation. 



3 Points Clustering 

The neural oscillator points clustering is based on diffusive connections ^ 
and follows directly the general algorithm in section |2.5[ Let us denote 
Yi = (xi,yi) the coordinates of a point p^. Each point p^ is associated 
to an oscillator x*. The proximity gestalt principle [49l [20j EI] suggests 
strong coupling between oscillators corresponding to proximate points. More 
precisely, the coupling strength between x^ and Xj 



is 



Hj 



-lyj-yjT 





if j e M 
else 



(7) 



where Mi is a neighborhood of p^. For example Mi can be defined as the set 
of M points closest to p^. Then ([7| couples an oscillator x^ and with its M 
nearest neighbors. The local coupling can propagate to make the coupling 
in a larger scale. Higher M value reinforces the coupling. The parameter f3 
tunes the size of the clusters one expects to detect. The external input Ii of 



the oscillators in (11) are set as uniformly distributed random variables in 



the appropriate range. 

Fig. [4] illustrates an example in which the points make clearly two clusters. 
As shown in Fig. [4]-b, the oscillator system converges to two concurrently 
synchronized groups, each corresponding to one cluster, and separated in the 
time dimension. The identification of the two groups induces the clustering 
of the underlying points, as shown in Fig. [|-c. 
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Figure 4: a. Points to cluster, b. The oscillators converge to two concur- 
rently synchronized groups, c. Clustering results. The blue circles and the 
red crosses represent the two clusters. 
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Fig. [5] presents a more challenging setting where one seeks to identify a 
cluster in a cloud of points. The cloud is made of 300 points uniformly ran- 
domly distributed in a space of size 100 x 100, in addition to a cluster of 100 
Gaussian distributed points with standard deviation equal to 3 x 3. Thanks 
to the coupling (J7J), the neural oscillator system converges to one synchro- 
nized group that corresponds to the cluster with all the "outliers" totally 
desynchronized in the background, as shown in Fig. [5]-c. The synchronized 
traces are segregated from the background (for example by thresholding the 
correlation among the traces) which results in the identification of the un- 
derlying cluster, as shown in Fig. [5}b. Fig. [5}d plots along time the number 
of the oscillators simultaneously spiking. The peaks in the trace tell the exis- 
tence of a cluster (similarly to e.g. [T81H8] ). and their amplitude (about 115) 
indicates the number of points in the cluster: the oscillators which belong to 
the cluster are synchronized and thus spike together to make a high peak. 



4 Contour Integration 



Field and his colleagues [12] have shown some interesting experiments, an 
example being illustrated in Fig. [HJ to test human capacity of contour inte- 
gration, i.e. of identifying a path within a field of randomly-oriented elements 
and made some quantitive observations in accordance with the "good con- 
tinuation" law |35l [201 [3T|: 

• Contour integration can be made when the successive elements in the 
path, i.e., the element-to-element angle (3 (see Fig. [7]), differ by 60° or 
less. 

• There is a constraint between the element-to-element angle (3 and the 
element-to-path angle a (see Fig. [7]). The visual system can integrate 
large differences in element-to-element orientation only when those dif- 
ferences lie along a smooth path, i.e., only when the element-to-path 
angle a is small enough. For example, with a = 15° and f3 — 0° the con- 
tour integration is difficult, even though the observers can easily track 
a 15 degree orientation difference when there is no variation (a = 0° 
and/?= 15°). 
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Figure 5: a. A cloud of points made of 300 points uniformly randomly dis- 
tributed in a space of size 100 x 100, in addition to a cluster of 100 Gaussian 
distributed points with standard deviation equal to 3 x 3. b. Blue dots repre- 
sent the cluster detected by the algorithm and red crosses are the "outliers" . 
c. The neural oscillator system converges to one synchronized group that 
corresponds to the cluster with all the "outliers" totally desynchronized in 
the background, d. The number of oscillators simultaneously spiking. The 
peaks in the trace tell the existence of a cluster and their amplitude (about 
115) indicates the number of points in the cluster. 
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Figure 6: The left-hand panel shows the path of elements (the stimulus) that 
the subjects must detect when embedded in an array of randomly oriented 
elements (the stimulus plus background shown on the right). The stimulus 
consisted of 12 elements aligned along a path. In this example each successive 
element differs in orientation by 30° and for this difference in orientation the 
string of aligned elements is easily detected. This figure is cited from [T2] . 




Figure 7: The element-to-element angle j3 is the difference in angle of orien- 
tation of each successive path segment. The element-to-path angle a is the 
angle of orientation of the element with respect to the path. 
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Fig. [8] shows the setting of the contour integration experiments. An ori- 
entation value Oi e [0, 2n) is defined for each point i = (i,j) in a grid as 
illustrated by the flashes. The proposed algorithm detects the smooth con- 
tours potentially imbedded in the grid. 



Following the general visual grouping algorithm described in section [23 , 
neural oscillators with diffusive connections ^ are used to perform contour 
integration. Each orientation in the grid is associated to one oscillator. The 
coupling of the oscillators i and j follows the Gestalt law of "good continua- 
tion" and, in particular, the results of the psychovisual experiments of Field 
et al P2|: 

pxn ( l Q J- Q jl 2 _ l^-°ul 2 > \ if i: _ :i < w 



where 



°ij = 



if |i — j| > w 



arctan(i — j) if | arctan(i — j) — |Qi 2 ° jl | < | arctan(i — j) + n — 2 ° j | 
arctan(i — j) + n else 



is the undirectional orientation of the path ij (the closer to the average 
element-to-element orientation |Qi 2 modulo n). By making constraints on 
the element-to-element angle j3 (the first term in (J8|)) and the element-to- 
path angle a (the second term in (|8j)) 3 the neural oscillator system makes 
smooth contour integration. 5 and 7 tune the smoothness of the detected 
contour. As contour integration is known to be rather local [12], the coupling 
(|8|) is effective within a neighborhood of size (2w + 1) x (2w + 1). 

The example illustrated in Fig. |8]-a presents a grid in which orientations 
are uniformly distributed in space, except for one vertical contour. The 
orientation of the elements on the vertical contour undertakes furthermore a 
Gaussian perturbation of standard deviation a = 10°. The neural oscillator 
system converges to one synchronized group that corresponds to the contour 
with all the other oscillators desynchronized, as illustrated in Fig. |8]-c. The 
synchronized group is segregated by thresholding the correlation among the 
traces. This results in the "pop-out" of the contour, as shown in Fig. [8]-b. 
The parameters are configured as S = 20°, 7 = 10° and w = 1, which are 
in line with the results of the psychovisual experiments of Field et al [12J. 
Fig. [9] illustrates a similar example with two intersected straight contours. 

13 





Figure 8: Left: A vertical contour is embedded in a uniformly distributed 
orientation grid. Middle: The detected contour. Right: the traces of the 
neural oscillation. 
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Figure 9: Left: Two contours are embedded in a uniformly distributed ori- 
entation grid. Middle and right: the two identified contours. 



14 



Fig. [TD]-a illustrates a smooth curve embedded in the uniformly randomly 
distributed orientation background. With some minor effort, subjects are 
able to identify the curve due to its "good continuation". Similarly the 
neural system segregates the curve from the background with the oscillators 



lying on the curve fully synchronized, as illustrated in Fig. 10 -b. 
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Figure 10: Left: A smooth curve is embedded in a uniformly distributed 
orientation grid. Middle: The detected curve. 



5 Image Segmentation 



The proposed image segmentation scheme is based on concurrent synchro- 
nization |37j and follows the general visual grouping algorithm described 
In the basic version, the coupling gain between oscillators 



in section |2.5| 

are again inspired directly from more standard techniques, namely non-local 
grouping as applied e.g. to in image denoising [3], |4] in addition to the gestalt 
laws. Multi-layer neural networks and feedback mechanisms are then intro- 
duced to reinforce robustness under strong noise perturbation and to ag- 
gregate the grouping. Experiments on both synthetic and real images are 
shown. 
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5.1 Basic Image Segmentation 

One oscillator is associated to each pixel in the image. Within a neighborhood 
the oscillators are non-locally coupled with a coupling strength 



rv tJ 







if |i — jl < w 
otherwise 



(9) 



where u\ is the pixel gray-level at coordinates i = (i,j) and w adjusts the 
size of the neighborhood. Pixels with similar grey-levels are coupled more 
tightly, as suggested by the color constancy gestalt law [49l [20j EI] . Non-local 
coupling plays an important role in regularizing the image segmentation, with 
a larger w resulting in more regularized segmentation and higher robustness 
to noise. 



Fig. 
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-a illustrates a synthetic image (the gray-levels of the black, gray 
and white parts are 0, 128, and 255) contaminated by white Gaussian noise 
of moderate standard deviation a = 10. The segmentation algorithm was 
configured with j3 = a and w = 5. The oscillators converge into three 



concurrently synchronized groups as plotted in Fig. [TI}b which results in a 
perfect segmentation as shown in Fig. [TT]-c. 






Figure 11: a. A synthetic image (the gray-levels of the black, gray and 
white parts are respectively 0, 128, 255) contaminated by white Gaussian 
noise of standard deviation a = 10. b. The traces of the neural oscillation. 
The oscillators converge into three concurrently synchronized groups, c. 
Segmentation result. 



Fig. [12] show some natural image segmentation examples. The segmen- 
tation results are rather regular with hardly any "salt and pepper" holes, 
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thanks to the diffusive coupling. A sagittal MRI (Magnetic Resonance Imag- 



ing) image in Fig. 12-a is segmented in 15 classes, the segmentation results 
shown in Fig. [T2]-b. Salient regions such as cortex, cerebellum and lateral 



ventricle are segregated with good accuracy. Fig. [12]-c is a radar image in 
which boundaries are blurred. In the segmentation results with 20 classes as 



shown in Fig.[12]-d, the image, including the eye of the hurricane, is accurately 
segregated. 







Figure 12: Real image segmentation, a and b. A sagittal MRI image of size 
128 x 128 and the segmentation result in 15 classes, c and d. A radar image 
of size 128 x 128 and the segmentation result in 20 classes. 



17 



5.2 Noisy Image Segmentation with Feedback 



Fig. [13]-a shows a synthetic image contaminated by strong white Gaussian 
noise of standard deviation a = 40. Thanks to the non-local coupling, the 
neural oscillator network is robust and segmentation results (Fig. fl3[b) are 



more regular than those of algorithms which do not take into account the 



image regularity prior, such as e.g. fc-means (Fig. 13 -f). However, due to the 
heavy noise perturbation, the segmentation result is not perfect. A feedback 
scheme is introduced to overcome this problem. Specifically, in a loose anal- 
ogy with the visual cortex hierarchy, a second layer is introduced to reflect 
prior knowledge, in this case that proximate pixels of natural images are 
likely to belong to the same region. Feedback from the second layer to the 
first exploits this regularity to increase robustness to noise. 

The feedback error correction mechanism is implemented using the gen- 
eralized diffusive connections introduced in Section |2.3[ On top of the first 
layer previously described, a second layer of oscillators is added and cou- 
pled with the first layer, as shown in Fig. [T4j The second layer contains 
M oscillators, where M is the number of regions obtained in the first layer, 
the input of each oscillator being the average gray-level of an image region. 
Each oscillator in the first layer, indexed by (i, j), is coupled with all the 
M oscillators in the second layer, with the coupling strengths depending on 
the segmentation obtained in a neighborhood of (i,j). More precisely, the 
coupling matrices A and B in the generalized diffusive connection ^ are de- 
signed so that the coupling strength from an oscillator x 2 (m), m — 1, . . . , M, 
in the second layer to an oscillator Xi(z,j) in the first layer is proportional 
to number of pixels in the neighborhood N{i,j) which belong to the region 
m according to the current segmentation — as illustrated in Fig. [14} This 
inter-layer connection reinforces the coupling between the first-layer oscilla- 
tors and the second-layer oscillators which correspond to the locally dominant 
regions, and thus regularizes the segmentation. After each step of A and B 
adjustment, the oscillator network converges to a new concurrently synchro- 
nized status and the segmentation is updated. Figs. [l3]-b to [l3]-e show the 
segmentation results at the beginning of the 4th, 6th, 8th and 10th oscilla- 
tion periods. The segmentation errors are corrected as the feedback is going 



along. The segmentation is stable at Fig. 13-e after the 10th period. 



Fig. 15 -a illustrates an infrared night vision image heavily noised. The 
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segmentation result of the basic algorithm without feedback shown in Fig. [15 
b contains a few punctual errors and, more importantly, the contour of the 
segmented objected zigzags due to the strong noise perturbation. As illus- 
trated in Fig. [l5]-c, the feedback procedure corrected the punctual errors and 
regularized the contour. 



| ■ 







Figure 13: a. A synthetic image (the gray-levels of the black, gray and 
white parts are respectively 0, 128, 255) contaminated by white Gaussian 
noise of standard deviation a = 40. be. Segmentation by neural oscillators 
with feedback, shown at the 4th, 6th, 8th and 10th oscillation periods, f. 
Segmentation by fc-means. 



5.3 Multi-layer Image Segmentation 



The visual cortex is hierarchical, with cells at the lower levels having smaller 
reception fields than the ones at the higher levels and information aggregating 
from bottom to top J38J HE]. This structure can be imitated by multi-layer 
oscillator networks. 



Fig. 16 illustrates a two-layer image segmentation scheme. The image on 
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Figure 14: Neural network with feedback for image segmentation. In the first 
layer one oscillator is associated to each pixel. An oscillator is coupled within 
all the others in a neighborhood of size (2w + l) x (2w + l) (for clarity only the 
coupling with the 4 nearest neighbors are shown in the figure). The image 
is segmented into two regions marked by white and gray. The second layer 
contains two oscillators whose input are respectively the average gray-level of 
the two image regions. The coupling strength between an oscillator indexed 
by the coordinates (i,j) in the first layer and an oscillator indexed by the 
regions m = 1, 2 in the second layer is proportional to number of the pixels 
in the neighborhood N{i,j) = {(hj), (i ± 1, j), (i,j ± 1)} that belong to the 
region m according to the previous segmentation. Only five such couplings 
and their strengths are shown for clarity . 




Figure 15: Infrared image segmentation, a. Infrared image, b. Segmentation 
without feedback, c. Segmentation with feedback. 
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Figure 16: Two-layer image segmentation. 



the first layer is decomposed into four disjoint parts. Each part is treated as 
an independent image where a basic image segmentation is performed (in this 
symbolic example, each part is segmented into two regions.) The second layer 
aggregates the segmentation results obtained on the first layer, using a single 
oscillator to represent each resulting region, with the average gray-level in the 
region as the oscillator input. The coupling connections on the second layer 
follow the topology of the regions on the first layer: second-layer oscillators 
whose underlying regions in the first layer are in vicinity are coupled. The 
segmentation on the second layer merges some regions obtained in the first 
layer and provides the final segmentation result. (In this example, regions A 2) 
£>2, C2 and D 2 are merged on the second layer.) Multi-layer segmentation 
can follow the same principle. From a computational point of view, the 
multi-layer scheme saves memory and accelerates the program. 



Fig. [17] illustrate an example of the multi-layer image segmentation. The 
image of size 256 x 256 shown in Fig. [T7]-a is decomposed into four 128 x 128 



parts. Fig. [17]- b and c present respectively the segmentation results of the first 
and second layers, both with 6 classes. From the first layer to the second, 
some regions that belong to different parts are merged, which eliminates 
some boundary artifacts in the first layer. The merging of a number of 
regions belonging to the same part, on the other hand, contributes to the 



correction of some over-segmentation. The segmentation result in Fig. 17 
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c is satisfactory. The river, although rather noisy in the original image, is 
segmented from the land with accurate boundary. The two aspects of the 
land are accurately segregated as well. 
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Figure 17: Multi-layer image segmentation, a. Aerial image (256 x 256). b. 
Segmentation result of the first layer. Each of the four 128 x 128 parts is 
segmented in 6 classes, c. Segmentation result in 6 classes with the second 
layer. 



6 Concluding Remarks 

Inspired by neural synchronization mechanisms for perceptual grouping, sim- 
ple networks of neural oscillators coupled with diffusive connections have been 
proposed to solve visual grouping problems. Stable multi-layer algorithms 
and feedback mechanisms have also been studied. The same algorithm has 
been shown to achieve promising results on several classical visual grouping 
problems, including point clustering, contour integration and image segmen- 
tation. 

Appendix: Convergence and Stability 
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One can verify (|37j, to which the reader is referred for more details on the 
analysis tools) that a sufficient condition for global exponential concurrent 
synchronization of an oscillator network is 



A 



mm 



(VLV T ) > su P A max (^-(^t)) , (10) 

a,t V^ X / 



where A m i n (A) and A max (A) are respectively the smallest and largest eigen- 
values of the symmetric matrix A s = (A + A T )/2, L is the Laplacian matrix 
of the network (L^ = Ylj^i %> -^u = — % f° r J ^ an ^ V is a projection 
matrix on A^ 1 . Here A4^~ is the subspace orthogonal to the subspace Ai in 
which all the oscillators are in synchrony — or, more generally in the case of a 
hierarchy, where all oscillators at each level of the hierarchy are in synchrony 
(i.e., xi = • • • = x^v at each level). Note that A4 itself need not be invariant 
(i.e. all oscillators synchronized at each level of the hierarchy need not be a 
particular solution of the system) , but only needs to be a subspace of the ac- 
tual invariant synchronization subspace ([15], [37] section 3.3.i.), which may 
consist of synchronized subgroups according to the input image. Indeed, the 
space where all x^ are equal (or, in the case of a hierarchy, where at each 
level all x^ are equal), while in general not invariant, is always a subspace of 
the actual invariant subspace corresponding to synchronized subgroups. 

These results can be applied e.g. to individual oscillator dynamics of the 



type (11). Let J^ denote the Jacobian matrix of the individual oscillator 



dynamics (11), J^ = d[i)i Wi] T /d[vi,Wi] T . Using a diagonal metric trans- 
formation = diag(y / ca^, 1), one easily shows, similarly to [46j [47], that 
the transformed Jacobian matrix J^O -1 — diag(/c, 0) is negative definite for 



k > 3+^. More general forms of oscillators can also be used. For instance, 
other second-order models can be created based on a smooth function /(.) 
and an arbitrary sigmoid-like function <r(.) > such that < a f {.) < 1 , in 
the form 

Vi = f(Vi) -Wi + Ii (11) 

Wi = c[aa(/3vi) - Wi] (12) 

with the transformed Jacobian matrix negative definite for k > f{vj) + ^f- . 

From a stability analysis point of view, the coupling matrix K of the 
Gaussian-tuned coupling, composed of coupling coefficients k^ in Eq.Q, 
presents desirable properties. It is symmetric, and the classical theorem of 
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Schoenberg (see J32]) shows that it is positive definite. Also, although it may 
actually be state-dependent, the coupling matrix K can always be treated 
simply as a time- varying external variable for stability analysis and Jacobian 
computation purposes, as detailed in ([46], section 4.4, [37], section 2.3). 

Last, note that more generally the individual dynamics need not all be 
oscillators. In particular, memory-like or voting-like subdynamics could be 
introduced in a feedback hierarchy, by having the corresponding f^(x^,t) be 
the gradient of a scalar function of x^ with multiple local minima. 
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